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Abstract 

Can we develop a computer algorithm that assesses the creativity of a 
painting given its context within art history? This paper proposes a novel 
computational framework for assessing the creativity of creative products, 
such as paintings, sculptures, poetry, etc. We use the most common def¬ 
inition of creativity, which emphasizes the originality of the product and 
its influential value. The proposed computational framework is based on 
constructing a network between creative products and using this network 
to infer about the originality and influence of its nodes. Through a series 
of transformations, we construct a Creativity Implication Network. We 
show that inference about creativity in this network reduces to a variant of 
network centrality problems which can be solved efficiently. We apply the 
proposed framework to the task of quantifying creativity of paintings (and 
sculptures). We experimented on two datasets with over 62K paintings 
to illustrate the behavior of the proposed framework. We also propose a 
methodology for quantitatively validating the results of the proposed algo¬ 
rithm, which we call the “time machine experiment”. 


1 Introduction 

The field of eomputational ereativity is foeused on giving the maehine the ability to gen¬ 
erate human-level “ereative” produets sueh as eomputer generated poetry, stories, jokes, 
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music, art, etc., as well as creative problem solving. An important eharaeteristic of a ere- 
ative agent is its ability to assess its ereativity as well as judge other agents’ ereativity. In 
this paper we focus on developing a computational framework for assessing the creativity 
of produets, such as painting, sculpture, etc. We use the most common definition of ere- 


ativity, whieh emphasizes the originality of the product and its influential value [Paul and 


Kaufman 2014b|. In the next seetion we justify the use of this definition in eontrast to 


other definitions. The proposed eomputational framework is based on eonstrueting a net¬ 
work between products and using it to infer about the originality and influenee of its nodes. 
Through a series of transformations, we show that the problem ean reduee to a variant of 
network centrality problems, whieh ean be solved effieiently. 

We apply the proposed framework to the task of quantifying ereativity of paintings 
(and seulptures). The reader might question the feasibility, limitation, and usefulness of 
performing sueh task by a maehine. Artists, art historians and erities use different concepts 
to deseribe pantings. In particular, elements of arts sueh as space, texture, form, shape, 
eolor, tone and line. Artists also use prineiples of art ineluding movement, unity, harmony, 
variety, balanee, eontrast, proportion, and pattern; besides brush strokes, subjeet matter, 
and other deseriptive eoneepts [Fiehner-Rathus 20081. We eollectively eall these eon- 
eepts artistic concepts. These artistie eoneepts ean, more or less, be quantified by today’s 
eomputer vision teehnology. With the rapid progress in eomputer vision, more advaneed 
techniques are introdueed, whieh can be used to measure similarity between paintings with 
respect to a given artistie concept. Whether the state of the art is already sufficient to mea¬ 
sure similarity in meaningful ways, or whether this will happen in the near or far future, the 
goal of this paper is to design a framework that ean use such similarity measures to quan¬ 
tify our chosen definition of creativity in an objective way. Hence, the proposed framework 
would provide a ready-to-use approaeh that can utilize any future advances in computer 
vision that might provide better ways for visual quantifieation of digitized paintings. In 
faet, we applied the proposed framework using state-of-the-art eomputer vision teehniques 
and aehieved very reasonable automatie quantifieation of ereativity on two large datasets 
of paintings. Figure illustrates an example of the ereativity seores obtained on dataset 
eontaining 1710 paintings. 

One of the fundamental issues with the problem of quantifying creativity of art is how 
to validate any results that the algorithm ean obtain. Even if art historians would agree 
on a list of highly original and influential paintings that can be used for validation, any 
algorithm that aims at assigning ereativity scores will encounter three major limitations: I) 
Closed-world limitation: The algorithm is only limited to the set of paintings it analyzed. 
It is a elosed world for the algorithm where this set is every thing it has seen about art 
history. The number of images of paintings available in the publie domain is just a small 
fraetion of what are in museums and private eolleetions. II) Artistie eoneept quantification 
limitation: the algorithm is limited by what it sees, in terms of the ability of the underlying 
eomputer vision methods to eneode the important elements and prineiples of art that relates 
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Figure 1: Creativity seores for 1710 paintings from Artehive dataset. Eaeh point represents 
a painting. The horizontal axis is the year the painting was ereated and the vertieal axis 
is the ereativity seore (sealed). The thumbnails illustrate some of the painting that seored 
relatively high or low eompared to their neighbors. Only artist names and dates of the 
paintings are shown on the graph beeause of limited spaee. See Figure for a zoom in to 
the period 1850-1950 


to judging ereativity. Ill) Parameter setting: the results will depend on the setting of the 
parameters, where eaeh setting would mean a different way to assign ereativity seores with 
different interpretation and different eriteria. However, these limitations should not stop us 
from developing and testing algorithms to quantify ereativity. The first two limitations are 
bound to disappear in the future, with more and more paintings being digitized, as well as 
with the eontinuing advanees in eomputer vision and maehine learning. The third limitation 
should be thought of as an advantage, sinee the different settings mean a rieh ability of 
the algorithm to assign ereativity seores based on different eriteria. For the purpose of 
validation, we propose a methodology for validating the results of the algorithm through 
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what we denote as “time maehine experiments”, whieh provides evidence of the correctness 
of the algorithm. 

Having discussed the feasibility and limitations, let us discuss the value of using any 
computational framework to assess creativity in art. For a detailed discussion about the 
implications of using computational methods in the domain of aesthetic-judgment-related 
tasks, we refer the reader to [ Spratt and Elgammal[ 2014[ . Our goal is not to replace art 
historians’ or artists’ role in judging creativity of art products. Providing a computational 
tool that can process millions of artworks to provide objective similarity measures and 
assessments of creativity, given certain visual criteria can be useful in the age of digital 
humanities. From a computational creativity point of view, evaluating the framework on 
digitized art data provides an excellent way to optimize and validate the framework, since 
art history provides us with suggestions about what is considered creative and what might 
be less creative. In this work we did not use any such hints in achieving the creativity scores, 
since the whole process is unsupervised, i.e., the approach does not use any creativity, 
genre, or style labels. However we can use evidence from art history to judge whether the 
results make sense or not. Validating the framework on digitized art data makes it possible 
to be used on other products where no such knowledge is available, for example to validate 
computer-generated creative products. 


2 On the Notion of Creativity 


There is a historically long and ongoing debate on how to define creativity. In this section 
we give a brief description of some of these definitions that directly relate to the notion we 
will use in the proposed computational framework. Therefore, this section is by no means 


intended to serve as a comprehensive overview of the subject. We refer readers to [Taylor 


1988 Paul and Kaufman] 2014a j for comprehensive overviews of the different definitions 


of creativity. 

We can describe a person (e.g. artist, poet), a product (painting, poem), or the mental 
process as being creative [Taylor 1988[ Paul and Kaufman] 2014b |. Among the various 
definitions of creativity it seems that there is a convergence to two main conditions for a 
product to be called “creative”. That product must be novel, compared to prior work, and 
also has to be of value or influential [ [Paul and Kaufmanj [2014b| . These criteria resonate 
with Kant’s definition of artistic genius, which emphasizes two conditions “originality” 
and being “exemplary” Q Psychologists would not totally agree with this definition since 


' Among four criteria for artistic genius suggested by Kant, two describe the characteristic of a creative 
product “That genius 1) is a talent for producing that for which no determinate rule can be given, not a pre¬ 
disposition of skill for that which can be learned in accordance with some rule, consequently that originality 
must be it’s primary characteristic. 2) that since there can also be original nonsense, its products must at the 
same time be models, i.e., exemplary, hence, while not themselves the result of imitation, they must yet serve 
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they favor associating creativity with the mental process that generates the product ]Taylor[ 
1988[ [Nanay 1 12014||. However associating creativity with products makes it possible to 


argue in favor of “Computational Creativity”, since otherwise, any computer product would 
be an output of an algorithmic process and not a result of a creative process. Hence, in this 
paper we stick to quantifying the creativity of products instead of the mental process that 
create the product. 

Boden suggested a distinction between two notions of creativity: psychological cre¬ 
ativity (P-creativity), which assesses novelty of ideas with respect to its creator, and his¬ 
torical creativity (H-creativity), which assesses novelty with respect to the whole human 
history [Boden 1990[. It follows that P-creativity is a necessary but not sufficient condi¬ 
tion for H-creativity, while H-creativity implies P-creativity [ Bodenj 1990[ Nanay 20141. 
This distinction is related to the subjective (related to person) vs. objective creativity (re¬ 
lated to the product) suggested by Jarvie |Jarvie[ [1986[ . In this paper our definition of 
creativity is aligned with objective/H-creativity, since we mainly quantify creativity within 
a historical context. 


3 Computational Framework 

According to the discussion in the previous section, a creative product must be original, 
compared to prior work, and valuable (influential) moving forward. Let us construct a 
network of creative products and use it to assign a creativity score to each product in the 
network according to the aforementioned criteria. In this section, for simplicity and without 
loss of generality, we describe the approach based on a network of paintings, however the 
framework is applicable to other art or literature forms. 


3.1 Constructing a Painting Graph 


Let us denote by P = {pj, i = 1 ■ ■ ■ A^} a set of paintings. The goal is to assign a creativity 
score for each painting, denoted by C(pi) for painting p* . Every painting comes with a 
time label indicating the date it was created, denoted by t(pi). We create a directed graph 
where each vertex corresponds to a painting. A directed edge (arc) connects painting pi to 
Pj if Pi was created before pj. Each directed edge is assigned a positive weight (we will 
discuss later where the weights come from), we denote the weight of edge (puPj) by Wij. 
We denote by Wij the adjacency matrix of the painting graph, where Wij = Wij if there is 
an edge from pi to pj and 0 otherwise. Note that according to this definition, a painting is 
not connected to itself, i.e., wu = 0,i = 1 ■ ■ ■ N. By construction, Wij > 0 —)■ Wji = 0, i.e., 
the graph is anti-symmetric. 


others in that way, i.e., as a standard or 


rule for judging.” | Guyer and Wood 2000|-pl86 
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Figure 2: Illustration of the construction of the Creativity Implication Network: blue ar¬ 
rows indicate temporal relation and orange arrows indicate reverse creativity implication 
(converse). 


To assign the weights we assume that there is a similarity function that takes two paint¬ 
ings and produces a positive scalar measure of affinity between them (higher value indicates 
higher similarity). We denote such a function by S'(-, •) and, therefore, 


Wij — 


S{pi,pj) if t{pi) <t{pj). 

0 otherwise. 


Since there are multiple possible visual aspects that can be used to measure similarity, we 
denote such a function by S'“(-, •) where the superscript a indicates the visual aspect that 
is used to measure the similarity (color, subject matter, brush stroke, etc.) This implies 
that we can construct multiple graphs, one for each similarity function. We denote the 
corresponding adjacency matrix by kF", and the induced creativity score by (7“, which 
measure the creativity along the dimension of visual aspect a. In the rest of this section, 
for the sake of simplicity, we will assume one similarity function and drop the superscript. 
Details about the similarity function will be explained in the next section. 


3.2 Creativity Propagation 

Giving the constructed painting graph, how can we propagate the creativity in such a net¬ 
work? To answer this question we need to understand the implication of the weight of the 
directed edge connecting two nodes on their creativity scores. Let us assume that initially 
we assign equal creativity indices to all nodes. Consider painting pt and consider an in¬ 
coming edge from a prior painting pk. A high weight on that edge (wki) indicates a high 
similarity between pi and pk, which indicates that pi is not novel, implying that we should 
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lower the ereativity seore of pi (sinee pi is subsequent to pk and similar to it) and inerease 
the ereativity seore of pk- In eontrast, a low weight implies that pi is novel and henee ere- 
ative eompared to pk, therefore we need to inerease the ereativity seore of Pi and deereases 
that of Pk- 

Let us now eonsider the outgoing edges from pi. Aeeording to our notion of ereativity, 
for Pi to be ereative it is not enough to be novel, it has to be influential as well (some others 
have to imitate it). This indieates that a high weight, Wij, between pi and a subsequent 
painting pj implies that we should inerease the ereativity seore of Pi and deerease that of 
Pj. In eontrast, a lower weight implies that Pi is not influential on pj, and henee we should 
deerease the seore for pi and inerease it for pj. These four eases are illustrated in Figure]^ 
A eareful look reveals that the two oases for the ineoming edges and those for the outgoing 
edges are in faet the same. A higher weight implies the prior node is more influential and 
the subsequent node is less creative, and a lower weight implies the prior node is less 
influential and the subsequent node is more creative. 

3.3 Creativity Implication Network 

Before eonverting this intuition to a oomputational approaeh, we need to define what is 
oonsidered high and low for weights. We introduee a balaneing funotion on the graph. Let 
m{i) denote a balaneing value for node i, where for the edges eonneoted to that node a 
weight above m{i) is eonsidered high and below that value is oonsidered low. We define a 
balaneing funotion as a linear funotion on the weights oonneoting to eaoh node in the form 

„ / . { w — mil) if to > 0. 

= I 0 otherwise. 

We oan think of different forms of balaneing funetions that ean be used. Also there are 
different ways to set the parameter m{i) with different implioations, whieh we will disouss 
in the next seotion. This form of balaneing funotion basioally eonverts weights lower than 
m{i) to negative values. The more negative the weight of an edge the more ereative the 
subsequent node and the less influential the prior node. The more positive the weight of an 
edge the less ereative the subsequent node and the more influential the prior node. 

The introduetion of the negative weights in the graph, despite providing a solution to 
represent low weights, is problematio when propagating the ereativity soores. The intuition 
is, a negative edge between pi and pj is equivalent to a positive edge between pj and pi. This 
direetly suggests that we should reverse all negative edges and negate their values. Notiee 
that the original graph eonstruetion guarantees that an edge between pi and pj implies no 
edge between pj and pi, therefore there is no problem with edge reversal. This proeess 
results in what we eall “Creativity Implication Network”. We denote the weights of that 
graph by Wij and its adjaeeney matrix by W. This proeess ean be deseribed mathematieally 
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as 


B{wij) > 0 -)■ Wij = B{wij) 

B{wij) = 0 —)■ Wij = 0 

B{wij) < 0 -)■ Wji = -B{wij) 


The Creativity Implieation Network has one simple rule that relates its weights to ereativity 
propagation: the higher the weight of an edge between two nodes, the less creative the 
subsequent node and the more creative the prior node. Note that the direetion of the edges 
in this graph is no longer related to the temporal relation between its nodes, instead it is 
direetly inverse to the way ereativity seores should propagate from one painting to another. 
Notiee that the weights of this graph are non-negative. 

3.4 Computing Creativity Scores 

Given the eonstruetion of the Creativity Implieation Network, we are now ready to define 
a reeursive formula for assigning ereativity seores. We will show that the eonstruetion of 
the Creativity Implieation Network reduees the problem of eomputing the ereativity seores 
to a traditional network eentrality problem. The algorithm will maintain ereativity seores 
that sum up to one, i.e., the ereativity seores form a probability distribution over all the 
paintings in our set. Given an initial equal ereativity seores, the ereativity seore of node Pi 
should be updated as 



( 1 ) 


where 0 < a < 1 and N{pj) = In this formula, the ereativity of node pi is 

eomputed from aggregating a fraetion a of the ereativity seores from its outgoing edges 
weighted by the adjusted weights Wij. The eonstant term (1 — a)/N refleets the ehanee 
that similarity between two paintings might not neeessarily indieate that the subsequent one 
is influeneed by the prior one. For example, two paintings might be similar simply beeause 
they follow a eertain style or art movement. The faetor 1 — a refleets the probability of 
this ehanee. The normalization term N{pj) for node j is the sum of its ineoming weights, 
whieh means that the eontribution of node pj is split among all its ineoming nodes based 
on the weights, and henee, Pi will eolleet only a fraetion Wij/'ffk "^kj of the ereativity seore 
of Pj. 

The reeursive formula in Eq[^oan be written in a matrix form as 



( 2 ) 


where W is a eolumn stoehastie matrix defined as Wij = Wij/J^k'^kj, and 1 is a veetor 
of ones of the same size as C. It is easy to see that sinee W, C, and 4l are all eolumn 
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stochastic, the resulting seores will always sum up to one. The ereativity seores ean be 
obtained by iterating over Eq[^ until eonversion. Also a closed-form solution for the ease 
where a 1 ean be obtained as 




( 3 ) 


A reader who is familiar with soeial network analysis literature might direetly see the 
relation between this formulation and some traditional network eentrality algorithms. Eq[^ 
represents a random walk in a Markov ehain. Setting a = 1, the formula in Eq|^beeomes 
a weighted variant to eigenveetor eentrality [ Borgatti and Ever^ 20061, where a solution 

ean be obtained by the right eigenveetor eorresponding to the largest eigenvalue of W. 
The formulation in Eq|^is also a weighted variant of Hubbell’s eentrality [ Hubbell[ 19651. 
Einally the formulation ean be seen as an inverted weighted variant of the Page Rank al¬ 


gorithm [Brin and Page, 19981. Notiee that this reduetion to traditional network centrality 


formulations was only possible beeause of the way the Creativity Implieation Network was 
eonstrueted. 


3.5 Originality vs. Influence 

The formulation above sums up the two eriteria of ereativity, being original and being 
influential. We ean modify the formulation to make it possible to give more emphasis to 
either of these two aspeets when computing the ereativity seores. Eor example it might be 
desirable to emphasize novel works even though they are not influential, or the other way 
around. Reeall that the direetion of the edges in Creativity Implieation Network are no 
longer related to the temporal relation between the nodes. We ean label (eolor) the edges 
in the network sueh that eaeh outgoing edge e{pi, pj) from a given node pi is either labeled 
as a subsequent edge or a prior edge depending on the temporal relation between pi and pj. 
This ean be aehieved by defining two disjoint subsets of the edges in the networks 

^pnor ^ {e{pi,pj) : t{pj) < t{pi)} 

^subseq ^ {e{pi,pj) : t{pj) > t{pi)} 


This results in two adjaoeney matriees, denoted by and IE® sueh that W = + IE®, 

where the superseripts p and s denote the prior and subsequent edges respeetively. Now 
Eq[^ean be rewritten as 


C{p^) 


(1 - «) 
N 



C{Pj 


NP{pj) 


+(I ~ /^) 


.. Cjp,] 

^^N^{pj 


(4) 
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where N^{pj) = J2k N^{pj) = J2k '’^kj- The first summation eolleets the ereativity 

seores stemming from prior nodes, i.e., eneodes the originality part of the seore, while the 
seeond summation eolleets ereativity seores stemming from subsequent nodes, i.e, eneodes 
influenee. We introdueed a parameter 0 < /3 < 1 to eontrol the effeet of the two eriteria on 
the result. The modified formulation above ean be written as 

C = + a[pWPC + {1 - ^)W^C], (5) 

where Wp and are the eolumn stoehastie adjaeeney matriees resulting from normalizing 
the eolumns of and respeetively. It is obvious that the elosed-form solution in Ecy^ 

is applieable to this modified formulation where W is defined as IT = /31 Tp + (1 — f3)W^. 


4 Creativity Network for Art 

In this seetion we explain how the framework ean be realized for the partieular ease of 
visual art. 

Visual Likelihood: For eaeh painting we ean use eomputer vision teehniques to obtain 
different feature representations for its image, eaeh eneoding a speeifie visual aspeet(s) 
related to the elements and prineiples of arts. We denote sueh features by /“ for painting 
Pi, where a denotes the visual aspeet that the feature quantifies. We define the similarity 
between painting pi and pj, as the likelihood that painting pj is eorning from a probability 
model defined by painting pi. In partieular, we assume a Gaussian probability density 
model for painting pt, i.e., 

S‘'{Pj,Pi) = Pr{pj\pi,a) =U{-]fi,a^I). 

It is important to limit the eonneetions eorning to a given painting. By eonstruetion, any 
painting will be eonneeted to all prior paintings in the graph. This makes the graph highly 
biased sinee modern paintings will have extensive ineoming eonneetions and early paint¬ 
ings will have extensive outgoing eonneetions. Therefore we limit the ineoming eonnee- 
tions to any node to at most the top K edges (the K most similar prior paintings). 

Temporal Prior: It might be desirable to add a temporal prior on the eonneetions. If 
a painting in the nineteenth eentury resembles a painting from the fourteenth eentury, we 
shouldn’t neeessarily penalize that as low ereativity. This is beeause eertain styles are 
always reinventions of older styles, for example neoelassieism and renaissanee. Therefore, 
these similarities between styles aeross distant time periods should not be eonsidered as 
low ereativity. Therefore, we ean add a temporal prior to the likelihood as 

S''{Pj,Pi) = Pr%pj\pi,a) ■ Pr\pj\pi), 
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where the seeond probability is a temporal likelihood (what is the likelihood that pj is 
influeneed Pi given their dates) and the first is the visual likelihood. There are different ways 
to define sueh a temporal likelihood. The simplest way is a temporal window funetion, i.e., 
Pr^{pj \pi) = 1 if Pi is within K temporal neighbors prior to pj and 0 otherwise]^ 

Balancing Function: There are different ehoiees for the balaneing funetion B{w), as 
well as the parameter for that funetion. We mainly used a linear funetion for that purpose. 
The parameter m ean be set globally over the whole graph, or loeally for eaeh time period. 
A global m ean be set as the p-pereentile of the weights of the graph, whieh is p-pereentile 
of all the pairwise likelihoods. This direetly means that p% of the edges of the graph will be 
reversed when eonstrueting the Creativity Implieation Graph. One disadvantage of a global 
balaneing funetion is that different time periods have different distributions of weights. This 
suggests using a loeal-in-time balaneing funetion. To aehieve that we eompute m* for eaeh 
node as p% of the weight distribution based on its temporal neighborhood. 


5 Experiments and Results 

5.1 Datasets and Visual Features 


Artchive: This dataset was previously used for style elassifieation and influenee diseov- 
ery [ Saleh et al.[ 20141. It contains a total of 1710 images of art works (paintings and 
sculptures) by 66 artists, from 13 different styles ranging from AD 1412 to 1996, chosen 
from Mark Harden’s Artchive database of fine-art [Harden |. The majority of these images 
are of the full work, while a few are details of the work. 

Wikiart.org: We used the publicly available dataset of “Wikiart paintings’^ which, 
to the best of our knowledge, is the largest online public collection of artworks. This 
collection has images of 81,449 fine-art paintings and sculptures from 1,119 artist spanning 
from 1400 till after 2000. These paintings are from 27 different styles (Abstract, Byzantine, 
Baroque, etc.) and 45 different genres (Interior, Landscape, Portrait, etc.). We pruned the 
dataset to 62,254 western paintings by removing genres and mediums that are not suitable 
for the analysis such as sculpture, graffiti, mosaic, installation, performance, photos, etc. 

For both datasets the time annotation is mainly the year. Therefore, it is not possible to 
tell which is prior between any pair of paintings with the same year of creation. Therefore 
no edge is added between their corresponding nodes. 

We experimented with different state-of-the-art feature representations. In particular, 
the results shown here are using Classeme features [jTorresani et al.[ |2010|. These fea- 


^Alternatively, a Gaussian density can be use, PF{pj\pi) = eyip{—\t{pi)—t{pj)Y/af). However, adding 
such temporal Gaussian would complicate the algorithm since it will not be easy to estimate a suitable at, 
specially the graph can have non-uniform density over the time line. 

^http://www. wiki art.org/ 
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tures were shown to outperform other state-of-the-art features for the task of style clas- 


sifieation [ |Saleh et aL| |2014| . These features (2659 dimensions) provide semantie-level 
representation of images, by eneoding the presence of a set of basic-level object categories 
(e.g. horse, cross, etc.), which captures the subject matter of the painting. Some of the 
low-level features used to learn the Classeme features also capture the composition of the 
scene. We also experimented with GIST features, which mainly encode scene decompo¬ 
sition along several perceptual dimensions (naturalness, openness, roughness, expansion, 
ruggedness) [ [Oliva and Torralb^ |2001[ . GIST features are widely used in the computer 
vision literature for scene classification. 


5.2 Example Results 

We show qualitative and quantitative experimental results of the framework applied to the 
aforementioned datasets. As mentioned in the introduction, any result has to be evaluated 
given the set of paintings available to the algorithm and the capabilities of the visual fea¬ 
tures used. Given that the visual features used are mainly capturing subject matter and 
scene composition, sensible creativity scores are expected to reflect these concept. A low 
creativity score does not mean that the work is not creative in general, it just means that 
the algorithm does not see it creative with respect to its encoding of subject matter and 
composition. 

Figures shows the creativity scores computed for the Artchive datasej^ In this fig¬ 
ure and all following figures we plot the scores vs. the year of the painting. The figures 
visualize some of the paintings that obtained high scores, as well as some with low scores 
(the scores in the plots are scaled). We randomly sampled points with low scores for vi¬ 
sualization. A close look at the paintings that scored low (bottom of each plot) reveals the 
presence of typical subject matter, or in some cases the image presents an unclear view of 
a sculpture (e.g. Rodin 1889 sculpture in the bottom right of Figure [^. 

There are several interesting paintings that achieved high creativity scores. For exam¬ 
ple, the scream by Edvard Munch’s (1893) scored very high relative to other paintings in 
that period (see Figure [^. This painting is considered as the second iconic figure after 
Leonardo’s Mona Lisa in the history of art, and it is known to be the most-reproduced 


painting in the twentieth century [Johnson 2003 j. It is also one of the most outstanding 
expressionist paintings. 

Figure 1^ shows a zoom-in plot to the period between 1850-1950, which is very dense 
in the graph of Figure We can see that Picasso’s La Celestina (1903) scored the highest 
among his blue-period paintings. Picasso’s Ladies of Avignon (1907) sticks out as high in 
creativity (obtained the highest score between 1904-1911). Art historians indicate that the 
flat picture plane and the application of primitivism in this painting made it an innovative 


■^For Figurea temporal prior was used. We set K=500, a=0.15. 
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work of art, which lead to Picasso’s cubism [Cooper , 19711. We can notice a sharp increase 
in creativity scores at 1912, dominated by cubism work, with Picasso’s Maquette for Guitar 
(1912) is the highest scoring in that surge. The up trend in creativity scores continues 
with several of Kasimir Malevich’s first Suprematism paintings in 1915 topping the scores. 
This includes Malevich’s Red square (1915), Airplane Flying (1915) and Black and Red 
square (1915) with almost identical scores at the top of this group, followed by Suprematist 
Construction (1915), Two-dimensional Self Portrait (1915), and Supermatist Composition 
(1915) - See Figure[^(the thumbnails of the last three paintings are not shown in the figure). 
Malevich’s 1915’s Black Square was not included in the analyzed collection that is used 
for this plot. However, his 1929’s version of the Black Square was part of the collection 
and scored as high (see the blue star around the year 1929 in Figure |^. The majority 
of the top-scoring paintings between 1916 and 1945 were by Piet Mondrian and Georgia 
O’Keeffee. 

One of the interesting findings is the ability of our algorithm to point out wrong anno¬ 
tations in the dataset. For example, one of the highest scoring paintings around 1910 was 
a painting by Piet Mondrian called “ Composition en blanc, rouge et jaune,” (see the red- 
dotted-framed painting in Figure]^. By examining this painting, we found that the correct 
date for it is around 1936 and it was mistakenly annotated in the Artchive dataset as 191(Q 
Modrain did not start to paint in this grid-based (Tableau) style untill around 1920. So it 
is no surprise that wrongly dating one of Mondrain’s tableau paintings to 1910 caused it 
to obtain a high creativity score, even above the cubism paintings from that time. On the 
Wikiart dataset, one of the highest-scored paintings was “Tornado” by contemporary artist 
Joe Goode, which was found to be mistakenly dated 1911 in Wikiarj^ A closer look at the 
artist biography revealed that he was bom in 1937 and this painting was created in 199l|^ 

It is not surprising for a painting that was created in 1991 to score very high in creativity 
if it was wrongly dated to 1911. These two example, not only indicate that the algorithm 
works, but also show the potential of proposed algorithm in spotting wrong annotations in 
large datasets, which otherwise would require tremendous human effort. 

Figurej^shows the creativity scores obtained for 62K paintings from the W iki art dataset^ 
Similar to the figures above, we plot the scores vs. the year of the painting. We also ran¬ 
domly sampled points with low scores for visualization. The general trend in Figure 
shows peaks in creativity around late 15th to early 16th century (the time of High Renais¬ 
sance), the late 19th and early 20th centuries, and a significant increase in the second half 
of the 20th century. 


^The wrong annotation is in the Artchive CD obtained in 2010. The current online version of Artchive 
has corrected annotation for this painting 

^http : / /WWW. wikiart. org/en/ joe-goode/tornado-1911 - accessed on Feb 28th, 2015 
'http://WWW.artnet.com/artists/joe-goode/tornado-9-2Y7erPME95YlkhFp7DRWlA2 

^For Figurej^no temporal prior was used. We set K=500, a=0.15. 
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Figure 3: Zoom in to the period of 1850-1950 from Figure Each point represents a 
painting. The horizontal axis is the year the painting was created and the vertical axis 
is the creativity score (scaled). Only artist names and dates of the paintings are shown 
on the graph because of limited space. The red-dotted-framed painting by Piet Mondrain 
scored very high because it was wrongly dated in the dataset to 1910 instead of 1936. See 
Section [5^ for a detailed explanation. 
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Figure 4: Creativity scores for 62K painting from the Wikiart dataset. The horizontal axis 
is the year the painting was created and the y^tical axis is the scaled creativity score. 
















Originality vs. Influence - Analysis of Religious Paintings 

In this experiment we investigate the effeet of the two criteria of creativity: originality vs. 
influence. For this purpose we use the formulation in Eq In this experiment we used 
the religious paintings from the Wikiart dataset. This subset contains 5256 paintings in the 
period AD 1410-1993. 

Figure |^ & |^ shows the creativity scores for this subset, where we set the parameter 
(3 = 0.9 to obtain the scores in Figure |^( i.e., emphasizing originality) vs. setting the 
parameter (3 = 0.1 to obtain the scores in Figure (i.e., emphasizing influence). From 
the figures we can notice that emphasizing originality biases the scores towards modern 
paintings, while emphasizing influence biases the scores towards earlier paintings in the 
collection. Comparing the same painting in the two figures can contrast its novelty vs. 
its influence. For example, Francisco Goya’s Crucified Christ (1780) scored very high 
in Figure indicating its originality, and scored lower in Figure when measuring its 
influence. However, in both cases that painting gets higher creativity scores than other 
paintings from the same period. 

It is clear that emphasizing originality results in an monotonically increasing upper 
envelop in the plot at the period from 1400 until around 1520 (see Figure [^. This means 
that in this period there is a clear trend of increasing originality, where some paintings are 
pushing the upper envelop of the plot monotonically up. This up trend ends in the plot 
around 1520, which coincides with the end of the High Renaissance and the beginning 
of the Mannerism movement. An interesting example of the paintings in the up trend 
of originality between 1400-1520 is Andrea del Castagno’s 1447 Last Suppeij^ which is 
the earliest painting depicting the Last Supper in the analyzed collection (see Figure [^. 
Domenico Ghirlandaio’s 1476 last suppeip^ scored higher along the upper envelop of the 
plot. In contrast, other versions of the Last Supper in the collection scored relatively lower, 
including Leonardo da Vinci’s famous fresco. Out of 18 paintings by da Vinci in this 
collection his St. John the Baptist (1515) scored the highest (see Figure]^. In the modern 
era, some of the paintings that scored very high in this religious collection are by Marc 
Chagall, Fernando Boetro, Salvador Dali and Nicholas Roerich (see Figures and for 
details). 

Two-dimensional Creativity - Analysis of Portrait Paintings 

Figure shows an example of two-dimensional analysis of creativity. In this experiment 
we used the subset of portrait paintings from the Wikiart dataset, which contains 12310 
painting from the period AD 1420-2011. We analyzed creativity using the Classeme and 

fresco located at the church of Sant’Apollonia in Florence 
^°A Fresco located in the abbey of San Michele Arcangelo a Passignano in Tavemelle Val di Pesa, near 
Florence 


16 



Hans Holbein the Younger 1521 


Caravaggio 

1610 



Marc Chagall 
1973 

Marc Chagall 
1974 

Marc Chagall 
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Andrea del^^ Flippo Lippi Benozzo Gozzoli Hans Holbein Jusepe de Dante Gabriel 


Castagno1447 


1430 


1461 


1522 


Ribera 1648 Rossetti 1849 


Figure 5: Creativity scores for 5256 religious paintings from the Wikiart dataset (AD 1410- 
1993), emphasizing originality in computing the creativity sores. The horizontal axis is the 
year the painting was created and the vertical axis is the scaled creativity score. 


17 



































Benozzo Gozzoli Luca Signorelli Albrecht Durer Marc Chagall Fernando Botero Marc Chagall Marc Chagall 



Thomas Cole S;^lv;^dor D;^li 
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Figure 6: Creativity scores for 5256 religious paintings from the Wikiart dataset (AD 1410- 
1993), emphasizing influence in computing the creativity sores. The horizontal axis is the 
year the painting was created and the vertical axis is the scaled creativity score. 
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GIST features as explained earlier, which yields two dimensions of creativity coordinates. 
Each point in the plot represents a single painting with two creativity scores. Unlike the 
previous figures, where we showed creativity vs. time, here we mainly show absolute 
creativity with respect to the two dimensions, i.e., we can not judge the relative creativity 
at any point of time from this plot. This makes the plot biased towards visualizing modern 
paintings. It is clear from the plot that the horizontal axis correlates with abstraction in the 
shape and form, while the the vertical axis correlates with texture and pattern. 

5.3 Time Machine Experiment 


Table 1: Time Machine Experiment 


Art movement 

avg % gain/loss 

%inerease 

Moving baekward to AD 1600 

Neoelassieism 

5.78%±1.28 

97%±4.8 

Romantieism 

7.52%± 2.04 

98%± 4.2 

Impressionism 

14.66%± 2.78 

99%±3.2 

Post-Impressionism 

16.82%±2.22 

99%±3.1 

Symbolism 

15.2%±2.94 

97%±4.8 

Expressionism 

16.83%±2.43 

98%±4.2 

Cubism 

13.36%±2.43 

89%±9.9 

Surrealism 

12.66%±1.82 

95%±7.1 

Ameriean Modernism 

11.75%±2.99 

84%±8.4 

Wandering around to AD 1600 

Renaissanee 

0.68 %± 2.05 

39%±5.7 

Baroque 

2.85%± 1.09 

71%±19.7 

Moving forward to AD 1900 

Renaissanee 

-8.13%±2.02 

20% ±10.5 

Baroque 

-10.2%±2.03 

0%±0 


Given the absence of ground truth for measuring creativity and the aforementioned 
wrong time annotations inspired us with a methodology to quantitatively evaluate the frame¬ 
work. We designed what we call “time machine” experiment, where we change the date of 
an artwork to some point in the past or some point in the future, relative to its correct time 
of creation. Then we compute the creativity scores using the wrong date, by running the 
algorithm on the whole data. We then compute the gain (or loss) in the creativity score of 
that artwork compared to its score using correct dating. What should we expect from an 
algorithm that assigns creativity in a sensible way? moving a creative painting back in his¬ 
tory would increase its creativity score, while moving a painting forward would decrease 
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Figure 7: Two dimensional ereativity seores for 12310 portrait paintings from the Wiki art 
dataset, ranging from 1420 until 2011. 
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its creativity. Therefore, we tested three settings: I) Moving baek to AD 1600: For styles 
that date after 1750, we set the test paintings baek to a random date around 1600 using 
Normal distribution with mean 1600 and standard deviation 50 years (i.e. A^(1600, 50^) ). 
II) Moving forward to AD 1900: For the Renaissanee and Baroque styles, we set the test 
paintings to random dates around 1900 sampled from A^(1900,50^). Ill) Wandering about 
AD 1600 (baseline): In this experiment, for the Renaissanee and Baroque styles, we set the 
test paintings to random dates around 1600 sampled from A^(1600, 50^). 

Tableshows the results of these experiments. We ran this experiment on the Artehive 
dataset with no temporal prior. In eaeh run we randomly seleeted 10 test paintings of a 
given style and applied the corresponding move. We used 10 as a small percentage of 
the dataset (less than 1%), not to disturb the global distribution of ereativity. We repeated 
eaeh experiment 10 times and reported the mean and standard deviations of the runs. For 
eaeh style we eomputed the average gain/loss of ereativity seores by the time move. We 
also eomputed the pereentage of the test paintings whose seores have inereased. From the 
table we elearly see that paintings from Impressionist, Post-Impressionist, Expressionist, 
and Cubism movements have signifieant gain in their ereativity seores when moved baek to 
1600. In eontrast, Neoelassieism paintings have the least gain, whieh makes sense, beeause 
Neoelassieism ean be eonsidered as revival to Renaissanee. Romantieism paintings also 
have a low gain when moved baek to 1600, whieh is justified beeause of the eonneetion 
between Romantieism and Gothieism and Medievalism. On the other hand, paintings from 
Renaissanee and Baroque styles have loss in their seores when moved forward to 1900, 
while they did not ehange mueh in the wandering-around-1600 setting. 


6 Conclusion and Discusion 

The paper presented a eomputational framework to assess ereativity among a set of prod- 
uets. We showed that, by eonstrueting a creativity implieation network, the problem re- 
duees to a traditional network eentrality problem. We realized the framework for the do¬ 
main of visual art, where we used eomputer vision to quantify similarity between artworks. 
We validated the approaeh qualitatively and quantitively on two large datasets. 

The most important eonelusion of this work is that, when introduced with a large eol- 
leetion of paintings (and seulptures), the algorithm ean sueeessfully highlight paintings that 
is eonsidered ereative (original and influential). The algorithm aehieved that without any 
knowledge about art or art history eneoded in its input. In most oases the results of the 
algorithm are pieoes of art that art historians indeed highlight as innovative and influential. 
The algorithm aehieved this assessment by visual analysis of paintings and oonsidering 
their dates only. 

Besides this qualitative evidenoe, we also proposed a methodology for validating the 
results of the algorithm through what we denote as time maehine experiments. This exper- 
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iments quantitively validated the proposed algorithm. 

In this paper we focused on “creative” as an attribute of a product, in particular artistic 
products such as painting, where creativity of a painting is defined as the level of its origi¬ 
nality and influence. However, the computational framework can be applied to other forms 
such as sculpture, literature, science etc. Quantifying creativity as an attribute of a product 
facilitates quantifying the creativity of the person who made that product, as a function 
over the creator’s set of products. Hence, our proposed framework also serves as a way to 
quantify creativity as an attribute for people. 

Clearly, it is not possible to judge creativity based on one specific aspect, e.g. use of 
color, perspective, subject matter, etc. For example it was the use of perspective that char¬ 
acterized the creativity at certain point of art history, however it is not the same aspect for 
other periods. This highly suggests the need to measure creativity along different dimen¬ 
sions separately where each dimension reflects certain visual aspects that quantify certain 
elements of art. The proposed framework can be used with multiple artistic concepts to 
achieve multi-dimensional creativity scoring. 
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